Towards data - intensive testing of abroad - coverage LFG grammar Jonas
نویسنده
چکیده
This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, such that the check of the actual solutions against the target speciication is performed by the standard LFG constraint solver. Dieser Beitrag widmet sich dem Problem, daa ein manuelles Durchgehen der Ausgabereprrsentationen in grooangelegten Tests innerhalb der Gram-matikentwicklung oder in datenintensiven Anwendungen der Grammatik wie der grammatikbasierten Lexikonakquisition aus Korpora unmmglich wird. Ein Verfahren wird vorgeschlagen, nach dem die zu parsenden Testsstze mit Zielausdrrcken annotiert werden. Die Ausdrrcke werden im LFG-Formalismus speziiziert, so daa das Standard-Constraintllsungsverfahren im LFG-Parser die berprrfung der tatsschlichen LLsungen gegennber der Zielspeziikation erledigt.
منابع مشابه
Towards data-intensive testing of a broad-coverage LFG grammar
This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, ...
متن کاملSina Zarrieß and Jonas Kuhn: Paraphrases in LFG-based broad-coverage semantics
This paper adresses the problem of modelling paraphrases in a deep linguistic processing framework where the meaning construction component is based on an LFG grammar. We present a syntax-based approach to paraphrase extraction that operates on shallow dependency analyses in a parallel corpus. By means of an XLE-based conversion routine, we generate transfer rules for the automatically acquired...
متن کاملCross-Lingual Induction for Deep Broad-Coverage Syntax: A Case Study on German Participles
This paper is a case study on cross-lingual induction of lexical resources for deep, broad-coverage syntactic analysis of German. We use a parallel corpus to induce a classifier for German participles which can predict their syntactic category. By means of this classifier, we induce a resource of adverbial participles from a huge monolingual corpus of German. We integrate the resource into a Ge...
متن کاملTIGER TRANSFER Utilizing LFG Parses for Treebank Annotation
Creation of high-quality treebanks requires expert knowledge and is extremely time consuming. Hence applying an already existing grammar in treebanking is an interesting alternative. This approach has been pursued in the syntactic annotation of German newspaper text in the TIGER project. We utilized the large-scale German LFG grammar of the PARGRAM project for semi-automatic creation of TIGER t...
متن کاملImproving data-driven dependency parsing using large-scale LFG grammars
This paper presents experiments which combine a grammar-driven and a datadriven parser. We show how the conversion of LFG output to dependency representation allows for a technique of parser stacking, whereby the output of the grammar-driven parser supplies features for a data-driven dependency parser. We evaluate on English and German and show significant improvements stemming from the propose...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998